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EVENT SENSING AND META-ROUTTNG PROCESS AUTOMATION 
FIELD OF THE IN\'ENTION 

[01] The present invention relates to process automation and more particularly to sense and 
response automation for managing data warehousing. 

BACKGROUND OF THE INVENTION 

[02] Data warehousing, the collection and integration of large amounts of enterprise related 
information, is essential to generating reports for large entexprises. For example, analysts at an 
airline use such reports of ticket sales grouped by airport, day of week, and other criteria to 
determine which routes the airline should add or drop. Merchandizing analysts at a clothing 
company use reports of clothing sales by region and season to determine which stores need to be 
restocked with which items. These reports are based on data collected from many different 
sources, such as point-of-sale terminals, and aggregated into a large database, commonly called a 
"data warehouse." Once the relevant data has been amassed within the data warehouse, reports 
smnmarizing the data can be generated for companies to use. 

[03] The sources that feed information into the data warehouse, however, tend to be quite 
different from one another or "heterogeneous," and the process of delivering the information 
produced by a particular data source into the data warehouse is often specific to that particular 
data source. In small enterprises, it is often economically acceptable for the data to be loaded 
manually by an employee who formulates and executes the appropriate conmiands for converting 
the data into the proper format, moving the data into the proper location, and loading the data 
into the data warehouse. Larger enterprises, however, have so much data that an automated 
approach is more economically desirable. One conmion automated approach to loading data into 
a data warehouse is known as "job-based scheduUng," which involves scripting and mainframe 
technology that has been initially developed more than 30 or 40 years ago. 

[04] With job-based scheduling, an information technology (TT) engineer creates scripts that 
are executed on a periodic basis (e.g. nightly at 4:00 a.m.) and that control, at a detailed level. 
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how the data is loaded, based on the particular requirements and formats of the incoming data 
and data warehousing. For example, the script may include the appropriate platform-specific 
conamands for renaming a file containing the incoming data, causing the file to be moved to an 
appropriate location (e.g. directory or folder) on the computer system for the data warehousing, 
running specific data conversion routines to put the file of incoming data into a format the data 
warehouse is capable of processing, and invoking the data warehouse-specific conmiand to cause 
the data warehouse to load the reformatted file. 

[05] Moreover, each data source has its own formatting and may be refreshed (i.e., become 
available with new data) at different times and periodicities than those of other data sources. For 
example, one data source can generate new information hourly, while another is daily. As a 
result, job-based scheduling script writers must be able to personally account for the differing 
data formats and handle the asynchronicity of the arrival of the data at different times. 

[06] Handling these issues, however, requires much skill and careful coding with complex 
methods, such as blocking jobs, parent job handlers, and so forth. As mainframe and other large 
computing system age, the numbers of people competent to craft such job-scheduled scripts 
diminishes with every year. The logic to handle these issues is hard-coded into the various 
scripts that are performed by the job-based, and upgrading or adding new data sources and data 
warehouses often require extensive recoding efforts and additional expense and time. 

[07] Some extraction, transformation, and loading (ETL) systems have been developed to 
address these complexities, but these ETL systems are often invasive, requiring tight integration 
both data sources and the data warehousing systems supplied by specific vendors and are difficult 
to extend to other installations or to be used with systems supplied by other vendors. 

SUMMARY OF THE INVENTION 

[08] There is a long-felt need for an asynchronous, non-invasive approach for automating the 
delivery of information from a variety of multiple data sources to a data warehouse. This and 
other needs are addressed by providing a sense-and-response framework that is capable of 
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receiving events in the process (e.g. data refreshing and other changes in data stores) and 
initiating appropriate and specific tasks in response to the routed events. Execution of the 
scheduled tasks may in turn generate additional events that are sensed and responded to until the 
process reaches an end point, e.g., delivered to a reporting system. 

[09] In one embodiment, this process automation can be implemented as an application 
platform capable of handling platform heterogeneity (i.e., handling multiple environments), task 
heterogeneity (i.e., several genres of task services such as file, extraction, transformation, 
loading, and database services), and time heterogeneity (i.e., handling multiple handshake points 
and periodicities of data availability). 

[10] The non-invasive process automation provided by embodiments of the present invention 
can handle process automation both horizontally across different heterogeneous environments 
but also vertically across different granularity levels, which may include inter-stage (between two 
distinct logical stages), and intra-stage (several events within a logical stage), and intra-task 
(checking for specific values within data). Furthermore, the automation of the various 
heterogeneous tasks can be exposed in the user interface along with clearly identified system 
handshake points. This allows for the process automation to be visualized clearly. 

[11] In one implementation, methods for the sense-and-response framework can be coded 
using web-services and/or the JAVA™ progKunming language to provide internal extensibility 
and external handshaking. An orchestration editor can also be provided to allow the user to 
design the automated process in a flexible and intuitive manner. Furthermore, sophisticated 
control, monitoring, playback, and exception handling can be provided, so that users can replay 
and debug the process automation. 

[12] For example, one embodiment of the present includes a distributed system for process 
automation comprising a computer-readable medium, an event router component, a means to 
start component, a conductor component and distributed services agents. A memory, storage, or 
other computer-readable medium store a representation of the process as a plurality of nodes and 
a plurality of directed links, in which each node has a plurality of attributes, each directed link 
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connect two of the nodes and represents the sequencing and possibly data dependency between 
the two of said nodes, and the nodes including start nodes. The event router component receives 
status transition events for said nodes and dispatching said events to said conductor and 
distributed services agents components. The means to start component prx)duces *ready2go' 
events for said plurality of start nodes. The conductor component handles *i^ady2go' and 
^completed' events by generating *inprocess* events in response to *ready2go' events and 
deciding when to produce additional 'ready2go' events for successor nodes in response to 
^completed' events. The distributed services agent handling 'inprocess* events by performing 
application work and producing ^completed' or *error' events. 

[13] This embodiment of the present invention represents advances over workflow systems 
implemented by the present inventors to automate processes involving human agents and 
machine components. For example, Ivanov (U.S. 5,706,452) describes a system in which the 
workflow process is represented using roles, stages, and directed links, with attributes associated 
to roles and stages. Advancement to the next review stage is done when a general criterion to 
finalize the stage is satisfied. In this embodiment, the workflow process is represented using 
nodes and directed links, and advancement to the next task is done when the "AND logic 
condition'' is satisfied, in a sense and response manner. 

[14] As another example, a proprietary system built by Amamath employed the concept of 
meta routing. The meta routes defined the allowed state transitions, but not within the sense and 
response framework that this embodiment uses. Furthermore, all tasks were executed from within 
the confines of a single database, as opposed to the distributed architecture of the present 
embodiment. Finally, implementations of the present embodiment include process based error 
handling. 

[15] Still other aspects, features, and advantages of the present invention are leadily apparent 
firom the following detailed description, simply by illustrating a number of particular 
embodiments and implementations, including the best mode contemplated for carrying out the 
present invention. The present invention is also capable of other and different embodiments, and 
its several details can be modified in various obvious respects, all without departing from the 
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spirit and scope of the present invention. Accordingly, the drawings and description are to be 
regarded as illustrative in nature, and not as restrictive. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[16] The present invention is illustrated by way of example, and not by way of limitation, in 
the figures of the accompanying drawings and in which like reference numerals refer to similar 
elements and in which: 

[17] FIG. 1 is an architectural diagram of one embodiment of the present invention. 

[18] FIG. 2 presents a sample Directed Acyclic Graph (DAG) used to define flows processed 
by the method of this invention. 

[19] FIG. 3 presents the conditional execution of nodes. 

[20] FIG. 4 depicts a computer system that can be used to implement an embodiment of the 
present invention. 

DESCRIPTION OF THE PREFERRED EMBODIMENT 

[21] A system, method, and software for process automation are described. In the following 
description, for the purposes of explanation, numerous specific details are set forth in order to 
provide a thorough understanding of the present invention. It is apparent, however, to one skilled 
in the art that the present invention may be practiced without these specific details or with an 
equivalent arrangement. In other instances, well-known structures and devices are shown in 
block diagram form in order to avoid unnecessarily obscuring the present invention. 

Architectural Overview 
[22] FIG. 1 is an architectural diagram of a process automation system according to one 
embodiment of the present invention. A core component of the process automation system is a 
server component 114, which communicates with other components to coordinate and perform 
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the tasks and services for implementing an automated process. The other components, with 
which the server component 1 14 communicates, may include a client browser 103, an admin 
browser 105, a repository 107, and a distributed services agent 109. 

[23] The client browser 103 presents a graphical user interface (GUI) for allowing a user to 
create, edit, monitor, execute, and playback processes. 

[24] The admin browser 105 provides a browser-based management tool for viewing and 
updating the administrative data for the automated process and other configuration management 
information. The admin browser 105 may also provide security functions and allow for entry and 
modification of relatively static (e.g. long term) information about the automated processes being 
handled by the automation system. 

[25] The repository 107 includes a persistent, non-volatile memory such as a database or file 
system for storing the configuration information, metadata, and process logic for the automated 
processes as well as state information (including service events and orchestration edits). 

[26] Conamunication between the client 103 and server 1 14, and between the server 1 14 and 
the distributed services agent 109 takes place via a web services interface 111, which envelops 
each of the three components. In the preferred embodiment of this invention, the web services 
interface 111 provides the marshalling and un-marshalling of data that is being passed around, 
encryption of the data stream to ensure data security, and the actual transport. 

[27] The server component 1 14 includes different subsystems, among which an orchestration 
engine 1 13, an edits module 120, and a plug-ins module 1 19. These subsystems serve to 
encapsulate distinct functions of different parts of the process automation system. 

[28] The distributed services agent 109 is capable of spawning tasks to accomplish specific 
operations in a heterogeneous environment in response to service initiation messages received 
from the plug-ins module 119 of the server 114 component. A distributed services agent 109 
may include modules for handling file services (e.g., deleting, concatenating, copying, renaming, 
moving files within a directory or file system), Structured Query Language (SQL) services (e.g., 
formulating and submitting relational database system conmiands, statements, and queries to a 
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relational database or other implementation of the data warehouse and processing the results 
received in response), time services (e.g., scheduling completion of a process for a given time), 
operating system (OS) ser/iccs (e.g. executing batch scripts), and analytic services (e.g. 
interfacing with a data analysis program such as ESSBASE™). Distributed services agents 109 
may execute on different machines over a network. Service events generated from performing 
the various service tasks are transmitted back to the event router 118 within the orchestration 
engine 113 for further processing. 

[29] The orchestration engine 1 13 provides real time event publication to the client browser 
103 for visualizing the execution of the process. This visualization allows the user to 
inmiediately grasp the state of the overall process. For example, currently executing events or 
tasks can be displayed as icons with a green background, while tasks with an error condition are 
shown with a red background. 

[30] The orchestration engine 113 communicates directly with the repository 107. This 
information may include metadata edits and auditing information to register within the repository 
107 for storing metadata, entitlement, version, agent, and other relatively stable information. 
Shorter-term information, such as recorded service events and orchestration edits, may be kept in 
an orchestration repository component of the repository 107. 

[31] In one implementation, the orchestration engine 1 13 has three major components, a 
monitor 1 15, an event router 1 18, and a conductor 1 17. The monitor 1 15 is responsible for 
initiating the execution of a flow and performing house-keeping routines, such as checking if 
executing tasks are taking longer than the historical average for the respective tasks. The monitor 
115 generates events which are then passed to the event router 1 18. In the preferred embodiment 
of this invention, the events are routed to the respective processors using a set of meta routes 
which do not depend on the topology of the process flow. The routes arc static and depend only 
on the component types being assembled in a flow. The conductor 1 17 is the part that 
orchestrates the process flow. The conductor 117 receives events from the event router 118 and, 
in response, generates other events and delivers the generated events back to the event router 
118. The event router 118 is the module that receives events from the monitor 1 15, the 
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distributed services agent 109, and conductor 117, and dispatches the events to their handlers, 
namely the conductor 1 17, and plug-ins module 1 19. In turn, the plug-ins module 1 19 relays the 
event to the distribuied services agent 109 as a service initiation request. When the invoked 
service completes or reports an error condition, it generates a service event which is delivered to 
the event router 118. 

[32] The orchestration engine 113 communicates real time events to all clients 103 that have 
subscribed to a particular set of events. It also saves the events and audit data to the repository 
107. 

[33] A preferred embodiment of the process automation system implements various advanced 
features. One feature is domain separation and security entitlements, which allow processes to 
run in independent domains. For example, developers can have an individual sandbox for 
creating and testing new process flows independently of other developers. Within a given 
domain, only the latest version of a flow is active; all others are phased out as they complete. 
Different privileges and entitlements (for example, to create, monitor, and/or administrate 
different processes) can be assigned on a domain and/or user basis. 

[34] Side-by-side versioning is another feature that can be implemented with an embodiment 
of the present invention to allow modifications to existing process flows to be introduced into a 
production environment without affecting existing flows, thereby phasing in a version of the 
automated processes with little disraption. 

[35] Furthermore, specialized nodes can be provided to enhance the functionality of the 
system. For example, boundary nodes can be used to allow processes to have different 
periodicities on the same orchestration graph. Parameter feeder nodes can allow values to be fed 
into the process at any point. 

Operational Overview 
[36] One implementation of the process automation system uses directed acyclic graphs 
(DAGs) to describe process flows (see FIG. 2). In such a process flow graph topology, a node 



Attorney Docket No.: 01035-1001 Patent 

describes a process and a link describes a dependency. The intrinsic semantics of such a 
structure, referred to as "the AND logic condition," is that a node can be executed if and only if 
all Its predecessors have been successfully executed. Each node in the process flow graph has 
status information associated with it. One implementation of the process automation system uses 
the following statuses: 'notreached', 'readj^go', 'inprocess', 'completed', 'pseudo-completed', 
'cachedV'blocked', 'error', 'warning'. 

[37] The events sensed and responded to include status chan^ in any of the nodes that are 
part of the process flow. The monitor 115 generates start events as follows: when there is no 
process that is active in a node section, the monitor 115 determines which nodes are the start or 
blocked nodes and changes the status of those nodes to 'ready2go' or 'inprocess' respectively, in 
effect starting or restarting the process flow. The conductor 1 17 is notified by the event router 
118 of status changes to •ready2go', 'completed', or 'pseudo-completed*. Upon receiving a 
'ready2go' ev«it, the conductor 117 generates an 'inprocess* event if the node is active, 
otherwise a 'completed' event. 

[381 'Inprocess' status changes (events) are received by the event routo: 118 and are routed to 
the plug-ins module 1 19, which contains a services stub for each type of node in the DAG. For 
example, a 'filewatch' node is handled by a 'filewateh' processor. An event handler pn)cessor 
can either process the service request locally or invoke the distributed services agent 109 to 
perform the actual work remotely. 

[39] Whenever there is an error in any node, the status of the node is changed to 'error' . 
Normally, the 'error' status is not changed automatically and a node can stay in this status 
indefinitely. 

[40] Nodes in a process flow can be tagged as boundary nodes. The set of boundary nodes 
define flow sections (slices). The semantics of a flow with two sections is that at any moment 
tiiere can be two active instances of the process flow. The goal is to overiap time consuming 
operations to improve speed and the set of boundary nodes are used to compute these sections. 
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[41] In order to enforce a "one flow instance active within a section at any time" constraint, 
when boundary nodes are reached, their status is changed to *ready2go' if the next section is not 
active (there is no active node within the section). If the section is active then the new status of 
the boundary node is blocked (status is •cached'). It is the responsibility of the monitor 1 15 to 
revive blocked nodes, that is, whenever the section has become inactive, to change the status of 
the boundary node to *ready2go\ 

[42] The process automation system allows conditional execution of nodes. Upon evaluation 
of some condition, the process automation system decides whether to execute any of the 
successor nodes. To allow subsequent execution path convergence (see FIG. 2), nodes that are 
not executed because the condition was not met have their status changed immediately to 
*pseudo-completed'. In FIG. 3, if the status of the non executed node B2 were not changed to 
•pseudo-completed', node C would not be executed since in order to execute any node, all its 
predecessors would need to be ^completed'. Thus, in tiie AND logic condition statuses, 
*completed* and *pseudo-completed' are equivalent. 

[43] Each type of node in the process flow graph has a set of attributes associated with it. For 
example, a *filewatch' node has an attribute named "agent" which specifies the (possibly remote) 
machine on which the file is being watched, "file" which specifies the full path and file name of 
the file being watched. The process automation system allows both the hard coding of attributes 
and substitution at run time. There is a special type of node which generates substitution 
parameters via various mechanisms (hard coded or by evaluating a database query). 

[44] Attributes of a node can be stored in the repository 107 as strings. Individual processors 
convert each attribute to its proper type before utilizing it. Prior to using an attribute, its 
expression is evaluated and all parameters are substituted. TTie parameters to be substituted are 
distinguished syntactically within the attribute string. One embodiment uses the syntax 
<%=parameterName%>. The parameters available to a node are all parameters that were 
produced by all nodes preceding the node. In case of a name clash, tiie value of the most recentiy 
computed parameter is used. 
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[45] The process automation system allows the grouping of a node set into process sub-flows. 
Regular process sub-flows are similar to in-line routines from regular programming languages. 
Tney are defined once and used once at the location they are defined. Regular sub-flows arc thus 
merely a way to better organize a process flow (e.g., for viewing purposes). 

[46] Subroutine sub-flows are defined in one place and can be used in many diffeirat 
locations. They arc invoked via a special node ('subcall*) which identifies the sub-flows to be 
invoked and the parameters to be passed. The 'subcall' node can invoke multiple subroutine sub- 
flows, and multiple instances of the same subroutine sub-flow at a time. 

[47] Finally, an error handler sub-flow is also defined in one place and can be used in many 
locations. The manner in which an error handler flow is invoked differs from a subroutine flow: 
when the status of a node changes to 'error' the name of one error handling flow is computed 
from the "err_handler" attribute and the respective flow is invoked. Upon completion of the 
error handling flow, one embodiment allows two strategies: (1) to consider the invoking node 
•completed' and pass it a return code which in mm could be used in conditional links, and (2) to 
restart the flow fix)m one or more predecessor nodes, resetting all stattises of intermediate nodes 
to 'notreached' and undoing all respective parameter bindings. 

[48] In order to prevent double entry of attribute values, the preferred embodiment allows one 
node to use attributes defined in predecessor nodes. For example, the 'file concatenate' node 
does not use attributes to specify which files should be concatenated. Rather, it uses the attributes 
in all predecessor 'file' nodes, in effect using the Hnks not only as dependency links but as data 
links as well. However, in some circumstances one may desire to use a link only as a dependency 
link. The preferred embodiment of this invention adds the 'pass_data' attribute to a link to 
specify whether the link is a dependency only or a dependency and data link. 

[49] As a node is executed, its average execution time is monitored over aU the iterations 
available in the repository. At the node level, the flow designer can specify a percentage 
threshold A for the execution. If traditionally a node executes in X seconds and its current 
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execution takes more than (1+A) * X seconds then the node is flagged with an *TExecution time 
exceeded historical limits" warning. 

[50] When a client 103 enables monitor mode, the client 103 registers itself with the 
orchestration engine 1 13 as a listener. From now until the client 103 no longer accepts events 
(either because the client 103 switched off monitor mode or because the user has closed the client 
103), the orchestration engine 113 notifies the client 103 of any change in status events. The 
client changes the colors of the respective nodes and updates the run time attributes accordingly, 

[51] For both historical and operations reasons, events that occur in the process automation 
system are stored in the repository 107. This set of events is used in playback mode to visualize 
the execution of past events. 



Discussion of tedb Process Axjtomation Routines 
[52] Referring to the pseudocode in the appendix, routine TEAMonitor, which corresponds to 
the monitor 1 15 processor, is invoked periodically and performs the following functions: (1) for 
all nodes that are start nodes in the main flow check to see if the section that contains the node 
itself is already active. Include the node in the set of nodes checked. If false then activate the 
section via the node using a new run numbw and *ready2go' as the new status. (2) for all nodes 
that are blocked (in other words, the execution prerequisite is met), but were blocked because the 
section was already active try to unblock them: if the section which contains the node (and 
excluding the node itself) is not active then activate the section by changing the status to 
*ready2go' but use the blocked run number. 

[53] Routine TEAStarter, which is part of the conductor 1 17 processor, is invoked when the 
node status becomes •ready2go' and performs the following functions: if the "hold" attribute of 
the node is TRUE then do nothing else if the "active" attribute of the node is TRUE then change 
the status of the node to *inprocess\ otherwise change the status to •completed'. In both cases do 
not change the run number. Routine TEAStateMachine, also part of the conductor 1 17 processor, 
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is invoked when the node status becomes 'completed' or 'pseudo-completed*. In turn, it tries to 
activate all node successors. 

[54] Routine tryToActivate has a node and a run number as input parameters. First, the 
tryToActivate routine determines the status into which the node would have to be activated, by 
calling activateToStatus. If the status to activate the node into is not 'dont.activate' then, if the 
no<te is not a boundary node then the status of node is changed to that status to activate into. If 
the node is a boundary node then, if the section that contains the node is akeady active then the 
node is blocked (change the status to 'cached' and the run number is not changed, but saved 
along with the potential status and run number for the unblock time. If the section is not active 
then the section is activated normally (via the node, with the status to activate into and the 
parameter run number). Routine activateToStatus takes the node and a nm number as input 
parameters and has a default return value of 'pseudo-completed'. The predecessors of the node 
are evaluated using the "AND logic condition." Jf any predecessor has a status that is not 
•completed' or 'pseudo-completed', then the node is not activated. Also, if there is a predecessor 
that is 'blocked' and the predecessor is linked with the node via a 'use_cached' link, then the 
node is not activated either. Otherwise, the status to activate into is either •ieady2go* or 'pseudo- 
completed' (default). If at least one predecessor via a regular link is 'completed', then the status 
to activate into is •ready2go*. Also, if at least one predecessor via a conditional link is 
'completed' and the condition criterion is met then the status to activate into is *ready2go'. 

[55] Routine sectionlsActive takes a node and a boolean as parameters. It first computes the 
section that contains the node based on the boundary nodes. For performance reasons the sections 
may be computed when the flow is saved to the repository. If the set is empty then the section is 
not active. If the boolean indicates that the node itself should not be included in the set, then the 
node is excluded from the set. If all the nodes in the set are 'notreached' or 'cached' but not 
'blocked' then the section is not active. If all the nodes in the set are 'completed' or 'pseudo- 
completed' the section in not active either. Otherwise the section is active. 

[56] Routine activateSection takes a node, a status, and a run number as parameters. If the run 
parameter indicates that a new run should be used, then a new run number is generated, and the 
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activation status is set to *ready2go\ If the run parameter indicates that the blocked run should be 
used, then the ran number and the activation status are retrieved firom the blocked record. Next, 
all nodes in the section are processed, the status of the node is changed to the activation status, 
and its ran number is changed to the activation ran number. The nodes in the section are 
processed as follows: nodes on the boundaries have status changed to 'cached* (the ran number is 
not changed, the current status is saved, but the current ran number is not - cached but not 
blocked); and intemal nodes are cleared (status is changed to *notreached\ and the ran number is 
cleared). 

[57] The activateSublnstance routine is part of the distributed services agent 109, more 
specifically of the ^subcair processor. It simply activates all start nodes of that sub-flow with the 
specified instance number. 

Hardware Overview 

[58] FIG. 4 illustrates a computer system 400 upon which an embodiment according to the 
present invention can be implemented. The computer system 400 includes a bus 401 or other 
communication mechanism for conmiunicating information and a processor 403 coupled to the 
bus 401 for processing information. The computer system 400 also includes main memory 405, 
such as a random access memory (RAM) or other dynamic storage device, coupled to the bus 401 
for storing information and instractions to be executed by the processor 403. Main memory 405 
can also be used for storing temporary variables or other intermediate information during 
execution of instractions by the processor 403. The computer system 400 may further include a 
read only memory (ROM) 407 or other static storage device coupled to the bus 401 for storing 
static information and instractions for the processor 403. A storage device 409, such as a 
magnetic disk or optical disk, is coupled to the bus 401 for persistently storing information and 
instractions. 

[59] The computer system 400 may be coupled via the bus 401 to a display 411, such as a 
cadiode ray tube (CRT), liquid crystal display, active matrix display, or plasma display, for 
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displaying information to a computer user. An input device 413, such as a keyboard including 
alphanumeric and other keys, is coupled to the bus 401 for communicating information and 
command selections to the processor 403. Another type of user input device is a cureor control 
415, such as a mouse, a trackball, or cursor direction keys, for conmiunicating direction 
information and command selections to the processor 403 and for controlling cursor movement 
on the display 411. 

[60] According to one embodiment of the invention, process automation is provided by the 
computer system 400 in response to the processor 403 executing an arrangement of instructions 
contained in main memory 405. Such instructions can be read into main memory 405 ftom 
another computer-readable medium, such as the storage device 409. Execution of the 
arrangement of instructions contained in main memoiy 405 causes the processor 403 to perform 
the process steps described herein. One or more processors in a multi-processing an^gement 
may also be employed to execute the instructions contained in main memory 405. In alternative 
embodiments, hard-wired circuitry may be used in place of or in combination with software 
instructions to implement the embodiment of the present invention. In another example, 
reconfigurable hardware such as Field Programmable Gate Arrays (FPGAs) can be used, in 
which the functionality and connection topology of its logic gates are customizable at run-time, 
typically by programming memory look up tables. Thus, embodiments of the present invention 
are not limited to any specific combination of hardware circuitry and software. 

[61] The computer system 400 also includes a communication interface 417 coupled to bus 
401. The communication interface 417 provides a two-way data conomunication coupling to a 
network Unk 419 connected to a local network 421. For example, the communication interface 
417 may be a digital subscriber line (DSL) card or modem, an integrated services digital network 
(ISDN) card, a cable modem, a telephone modem, or any other communication interface to 
provide a data communication connection to a corresponding type of communication line. As 
another example, communication interface 417 may be a local area network (LAN) card (e.g. for 
Ethernet™ or an Asynchronous Transfer Model (ATM) network) to provide a data 
communication connection to a compatible LAN. Wireless links can also be implemented. In 
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any such implementation, communication interface 417 sends and receives electrical, 
electromagnetic, or optical signals that carry digital data streams representing various types of 
information. Further, the conmiunication interface 417 can include peripheral interface devices, 
such as a Universal Serial Bus (USB) interface, a PCMCIA (Personal Computer Memory Card 
Litemational Association) interface, etc. Although a single communication interface 417 
depicted in FIG. 4, multiple communication interfaces can also be employed. 

[62] The network link 419 typically im)vides data communication through one or more 
networks to other data devices. For example, the network link 419 may provide a connection 
through local network 421 to a host computer 423, which has connectivity to a network 425 (e.g. 
a wide area network (WAN) or the global packet data communication networic now commonly 
referred to as the "Intemet") or to data equipment operated by a service provider. The local 
network 421 and the network 425 both use electiical, electiiomagnetic, or optical signals to 
convey information and instructions. The signals through the various networks and die signals 
on the network link 419 and through the communication interface 417, which communicate 
digital data with the computer system 400, are exemplary forms of carrio: waves bearing the 
information and instructions. 

[63] The computer system 400 can send messages and receive data, including program code, 
tiirough the network(s), the network link 419, and the communication interface 417. In die 
Intemet example, a server (not shown) might tiansmit requested code belonging to an application 
program for implementing an embodiment of the present invention through the networic 425, the 
local network 421 and the communication interface 417. The process^ 403 may execute the 
transmitted code while being received and/or store the code in the storage device 409, or other 
non-volatile storage for later execution. In tiiis manner, tiie computer system 400 may obtain 
application code in the form of a carrier wave. 

[64] The term "computer-readable medium" as used herein refers to any medium that 
participates in providing instiiictions to die processor 405 for execution. Such a medium may 
take many forms, including but not limited to non-volatile media, volatile media, and 
tiansmission media. Non-volatile media include, for example, optical or magnetic disks, such as 
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the storage device 409. Volatile media include dynamic memory, such as main memory 405. 
Transmission media include coaxial cables, copper wire and fiber optics, including the wires that 
comprise the bus 401. Transmission media can also take the fonn of acoustic, optical, or 
electromagnetic waves, such as those generated during radio frequency (RF) and infrared (IR) 
data communications. Common fonns of computer-readable media include, for example, a 
floppy disk, a flexible disk, hard disk, magnetic tape, any other magnetic medium, a CD-ROM, 
CDRW, DVD, any other optical medium, punch cards, paper tape, optical marie sheets, any other 
physical medium with patterns of holes or other optically recognizable indicia, a RAM, a PROM, 
and EPROM, a FLASH-EPROM, any other memory chip or cartridge, a carrier wave, or any 
other medium from which a computer can read. 

[65] Various forms of computer-readable media may be involved in providing instructions to a 
processor for execution. For example, the instructions for carrying out at least part of the present 
invention may initially be borne on a magnetic disk of a remote computer. In such a scenario, the 
remote computer loads the instructions into main memory and sends the instructions over a 
telephone line using a modem. A modem of a local computer system receives the data on the 
telephone line and uses an infrared transmitter to convert the data to an infiaied signal and 
transmit the infiared signal to a portable computing device, such as a personal digital assistant 
(PDA) or a laptop. An infrared detector on the portable computing device receives the 
information and instructions borne by the infrared signal and places the data on a bus. The bus 
conveys the data to main memory, from which a processor retrieves and executes the 
instructions. The instructions received by main memory can optionally be stored on storage 
device either before or after execution by processor. 

Pseudocode Appendk 
[66] TEAMonitor // invoked p^odically 

[67] 

[68] begin 
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[69] forall (node) in StartNodesSet(null) do // top level start nodes only 

[70] if not sectionis Acti ve(node, TRUE) then acti vateSection(node, USE_NEW) 

[71] end forall 

[72] forall (node) in BlockedNodesSetQ do 

[73] if not sectionIsActive(node, FALSE) then activateSection(node, ", USE_BLOCKED) 
[74] end forall 
[75] end 
[76] 

[77] TEAStarter(node) // invoked when node.status becomes READY2GO 

[78] 

[79] begin 

[80] if not node.hold then 

[81] if node.active then saveStatus(node, TNPROCESS', RUN_DONT_CHANGE, 
node.status, node.run) 

[82] else saveStatus(node, 'COMPLETED', RUNJX)NT_CHANGE, nodcstatus, node.run) 
[83] end if 
[84] end 
[85] 

[86] activateSubInstance(caller, instance, run, pars) // invoked when a SUBFLOW node is 
executed 

[87] 

[88] begin 

[89] forall start in StartNodesSet(caller, instance) do 
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[901 activateSection(start, •READY2G0', run) 

[91] endforall 

[92] end 
[93] 

[94] TEAStateMachine(node, run) // invoiced when node.status becomes 'COMPLETED' or 
•PSEUDO-COMPLETED' 

[95] 

[96] begin 

[97] forall successor in Successors(node) do 

[98] tryToActivateCsuccessor, run) 

[99] endforaU 

[100] end 
[101] 

[102] activateSection(node, sta, run) 

[103] 

[104] begin 

[105] ifrun=USEJ^Wtlienrun = newRunO;sta = 'READY2GO' 

[106] if run=USE_BLOCKED then run = blockedRun(node); sta = blockedSta(node) 

[107] forall sec_node in Section(node) - {node} do 

[108] if node.status != " and sec_node.isBoundaiy then 

[109] saveStatus(sec_node, 'CACHED', RUN_DONT_CHANGE, sta, RUN_CLEAR) 

[110] else 
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[111] saveStatus(sec_node, RUN_CLEAR, STA.CLEAR, RUN_CLEAR) 
[112] endfoiall 

[113] saveStatus(node, sta, ran, STA^CLEAR, RUN.CLEAR) 

[114] end 

[115] 

[116] sectionIsActive(node, includeNode) 

[117] 

[118] begin 

[119] set = Section(node) 

[120] if set = NULL then return FALSE 

[121] if not includeNode then set = set - {node} 

[122] if set = NULL then return FALSE 

[123] if aU nodes in set are (" or CCACHED' but not bloclced)) then return FALSE 

[124] if aU nodes in set are 'COMPLETED' or PSEUDO-COMPLETED' then return FALSE 

[125] return TRUE 

[126] end 

[127] 

[128] tryToActivate(node, ran) 

[129] 

[130] begin 

[131] if (toStatus = activateToStatus(node, ran)) != 'DONT_ACnVATE' then 
[132] if not isBoundaiyNode(node) then 
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[133] saveStatus(node, toStatus, run, STA.CLEAR, RUN_CLEAR) 

[134] else // node is a boundary node 

[135] if sectionIsActive(node, FALSE) then 

[136] saveStatus(node, 'CACHED', RUNJ)ONT^CHANGE, toStatus, run) // bloclc 
[137] else 

[138] activateSection(node, toStatus, run) 

[139] end if 

[140] end if 

[141] end if 

[142] end 

[143] 

[144] activateToStatus(node, run) 

[145] 

[146] begin 

[147] toStatus = TSEUDO^OMPLETED* 

[148] forall pred in Predecessors(node) do 

[149] if not (pted.status = 'COMPLETED' or 

[150] pred.status = 'PSEUDO-COMPLETED' or 

[151] (pred.status = 'CACHED' and pi«d.Unk.flow_type = USE_CACHED) 

[152] ) then return 'DONT_ACTIVATE' 
[153] if pred.lin]c_type = REGULAR then 

[154] if pred.status = 'COMPI£TED' then toStatus = •READY2GO' 
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[155] else // pred.Unk_type = VALUE 

[156] if pred(nin).val = pred.link.val then toStatus = 'READY2G0' 
[157] end if 
[158] endforall 
[159] return toStatus 
[160] end 

Conclusion 

[161] While the present invention has been described in connection with a number of 
embodiments and implementations, the present invention is not limited but covers various 
obvious modifications and equivalent arrangements, which fall within the purview of the 
appoided claims. 
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